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Abstract 

The aim of this work is to estimate a quadratic functional of a unknown 
Wigner function from noisy tomographic data. The Wigner function can 
be seen as the representation of the quantum state of a light beam. The 
estimation of a quadratic functional is done from result of quantum homo- 
dyne measurement performed on identically prepared quantum systems. 
We start by constructing an estimator of a quadratic functional of the 
Wigner function. We show that the proposed estimator is optimal or 
nearly optimal in a minimax sense over a class of infinitely difi^erentiable 
functions. Parametric rates are also reached for some values of the smooth- 
ness parameters and the asymptotic normality is given. Then, we con- 
struct an adaptive estimator that does not depend on the smoothness 
parameters and prove it is minimax over some set-ups. 
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1 Introduction 

In quantum mechanics, the quantum state of a system completely describes all 
aspects of the system. The instantaneous state of a quantum system encodes the 
probabilities of its measurable properties, or "observables" (examples of observ- 
ables include energy, position, momentum and angular momentum). Generally, 
quantum mechanics do not assign determinist values to observables. Instead, 
it makes predictions about probability distributions; that is, the probability of 
obtaining each of the possible outcomes from measuring an observable. 
We have two mathematical representations of a quantum state: the density 
matrix p and its associated Wigner function Wp. The densitymatrix p, virhich 
describes completely a quantum state, is hermitian, positive definite and with 
trace one. It can be finite or infinite dimensional. Equivalently the correspond- 
ing Wigner function Wp : M may be defined. In general, Wp is regarded as 
a generalized probability density, integrating to plus one over the whole plane. 
It does not satisfy all the properties of a proper probability density as it can, and 
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normally does, go negative for states which have no classical model. It satisfies 
also certain intrinsic positivity constraints in the sense that it corresponds to a 
density matrix. 

In this paper we address the problem of estimating the quadratic functional 
(f — J Wp of the Wigner function of a monochromatic light in a cavity pre- 
pared in the state p by using Quantum Homodyne Tomographic (QHlO) data 
measurement performed on independent, identical systems. The Quantum Ho- 
modyne Detection (QHD) has been put in practice for the first time by Smithey 
et al. (1993) [16], we will detail this technique in section [2?2l 
We study the quantity = / Wp which has an interest in itself as a physical 
measure of the purity of quantum state. It allows us to detect pure state and 
mixed state as it always equals ^ in case of pure states. A state is called pure 
if it cannot be represented as a mixture (convex combination) of other states, 
i.e., if it is an extreme point of the convex set of states. All other states are 
called mixed states. 

The QHD technique gives results of the measure of the electric and the mag- 
netic fields (p, q) of the studied laser for some phase <&. In the ideal case, we 
would observe the random variable {X, $) = (cos($)(5 + sin($)P, $) where $ 
is chosen independently of {Q,P), and uniformly in the interval [0,7r]. In our 
paper we do not consider the ideal data (X, <I>) but the noisy observations (F, $) 
where Y is the sum of the random variable X and a gaussian random variable 
^. We assume that the unknown function Wp belongs to A{a, r, L) a class of 
super smooth functions where q;>0, 0<r<2 and L > will be defined later. 
Those classes are similar to those of Cavalier (2000) [7] for r — 1 and functions 
are defined on W^; Butucea and Tsybakov (2007) [6] on R; Butucea et al. (2007) 
[5] on R2 

The study of quadratic functional started with Bickel and Ritov (1988) [2], 
who have considered the problem of estimating the integral of the square of a 
derivative of a probability density function and obtained nonparametric rates. 
Their results have been extended by Birge and Massart (1995) [3] on the estima- 
tion of more general functional, who estabHshed nonparametric lower bounds. 
The study of general functional was completed by Kerkyacharian and Picard 
(1996) p2j for minimax rates. Laurent (1996) [H] gave efficient estimation of 
some functionals of a density function at parametric rate. The problem of adap- 
tive estimation of general functionals has been considered by Tribouley (2000) 
|18j in the classical white noise model. 

In the convolution model, Butucea (2004) [3] has estimated a quadratic func- 
tional of a density on R and applied it to the goodness-of-fit test in L2 distance. 
In our paper, the first difficulty is that we do not deal with proper probability 
density function but with quasiprobability density. Moreover, note that our 
problem is a double inverse problem as we observe the Radon transform of Wp 
(PET) with a convolution (white noise). 

Inverse problems have been extensively studied in mathematical literature. In a 
positron emission tomography (PET) perspective, the problem of estimating a 
probability density on M? from tomographic data (X^, $fe) has been treated by 

^We refer the interested reader to Artiles et al. (2005) J] for further details on the physical 
background 
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Korostelev and Tsybakov (1993) [13] and Johnstone and Silverman (1990) [IT]. 
Cavalier (2000) |7] considered also PET model and obtained an estimator of a 
multi-dimensional density function which is asymptotically sharp minimax, i.e. 
it achieves the optimal rate of convergence and attains the best constant for the 
minimax risk. 

The estimation of the Wigner function Wp has been treated by Gu^a and Ar- 
tiles (2006) [1^ in the case free of noise. Our noisy model has been studied in a 
parametric framework by D'Ariano and in a nonparametric framework for the 
estimation of the Wigner function by Butucea et al. (2007) [5]. We propose 
to estimate the integral of the square of the Wigner function rather than the 
function itself. 

Other problems have been considered, in the context of tomography: Goldensh- 
luger and spokoiny (2006) [8] have considered the problem of recovering edges 
of an image from noisy tomographic data in a white noise model and reached 
nearly optimal rate. Recovering boundaries in models that involve indirect ob- 
servations in the rf-dimensional Euclidean space K"^ has been discussed recently 
in Goldenshluger and Zeevi (2006) 0. We note that a Wigner function cannot 
have a bounded support. 

The main contributions of this paper are the following. We propose a method 
for estimating a quadratic functional of a generaHzed probability density which 
may take negative values from indirect and noisy observations in view to detect 
pure states and mixed states. It is shown that the proposed estimator is optimal 
or nearly optimal in a minimax sense -depending on the smoothness parameter 
r of the class A(a,r,L). Moreover, an adaptive estimator is constructed which 
attains optimal rates. Another main interest of the estimation of d? is the 
important appHcation to goodness-of-fit test in L2-norm in quantum statistics. 
This means that physicists want to test whether they produced a laser in the 
quantum state po or something different. This can be done via the Wigner 
functions as follows: 

\\Wp ~ Wp„\\2 > C • (p„. 

where is a sequence which tends to when n — > oo and it is the testing rate. 
We can device a test statistic based on the estimator of rf^ = / Wp constructed 
in this paper. Similary to Butucea (2004) [i] we conjecture that the testing 
rates are of the same order as the nonparametric ones found in this paper. 

The rest of the paper is organized as follows. In Section [3] we formulate 
the statistical model and introduce notation and properties of quantities of 
interest. In Section |4] we construct an estimator of the quadratic functional 
of the unknown Wigner function, along with the bias-variance decomposition. 
Our main theoretical results are presented in Section [H In Section [6l we derive 
some example of quantum states. Proofs of upper and lower bounds are given 
in Sections III [8] and [S 
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2 Preliminaries 



2.1 Definition 

We study this problem in a minimax framework. Let be an estimator of 
d? — J Wp based on this indirect noisy observations (Yi^^i), i — as 
anounced above. We measure the accuracy of d'^ by the maximal risk 

TZidl; A{a,r,L))^ sup Ep[\dl-d^f] 

WpGA(a,r,L) 

over the class A{a, r, L). Here Ep, Pp denote the expected value and probability 
when the true underlying quantum state is p. The minimax risk is defined by 

'R.*(A{a,r,L)) = inf 7^(3|; ^(a, r, L)) 

where the infimum is taken over all possible estimators d| of the quadratic 
functional of the Wigner function Wp. 

Let Lpn be a positive sequence, an estimator c?^ is optimal in a minimax sense 

• if it satisfies the following upper hound 

YimsviV^-^n{dl]A{a,r,L))<Cu, (1) 

n — >oo 

• and if the following lower hound is satisfied 

liminfinf (p-27^(d^^;y^(a,r,L)) > C;, (2) 

where the infimum is taken over all possible estimators of the quadratic func- 
tional of the Wigner function Wp. Then, is called optimal rate in a minimax 
sense. Our aim is to find rate optimal estimator of and to establish asymp- 
totics of minimax risks for some classes of Wigner functions A{a, r, L). We rely 
on Butucea et al. (2007) [5], who derived rate optimal pointwise and adaptive 
estimators of Wp (instead of / Wp in our case) from indirect noisy observations. 

2.2 Quantum Homodyne Tomography 

The theoretical foundation of quantum state reconstruction was outHned by 
Vogel and Risken (1989) [19] and has inspired the first experiments determining 
the quantum state of a fight field, initially with optical pulses with Smithey et 
al. (1993) [IB] and Smithey et al. [13 • 

The physicists developed a monochromatic laser in state p in a cavity. In order 
to study it, one takes measurement by quantum tomography homodyne (QHT). 
This technique schematized in figure [T] consists in mixing the laser to be studied 
with a laser of reference of high intensity \z\ >> 1 called local oscillator. Then 
the beam obtained is split into two and two photodetectors measure each one of 
the beams (/i,/2). One measures X the difference of the intensities of the two 
beams and rescale it by the intensity \z\. Thus for the cavity pulse chosen to be 
0, data [X, $) should be obtained. It is widely known in the physical litterature 
(see Leonhardt (1997) [15]) that an additive gaussian noise is mixed with ideal 
data X, giving for known efficiency rj, data Y . 
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Figure 1: QHT mesurement 



3 Statistical context 
3.1 Problem formulation 

In the present paper we estimate the integral of the square of the Wigner func- 
tion from data measurement performed on n identical quantum systems where 
the Wigner function is assumed to be a joint generaHzed density of two variables 
P and Q, Wp : ^ R. It may take negative values but it integrates to plus 
one over the whole plane. For further information of the Wigner function, we 
invite readers to refer to the paper by Artiles et al. [T]. 

Our statistical problem can been seen as follow: consider (Xi, $i) . . . (X„, $„) 
independent identically distributed random variables with values in M x [0,7r]. 
The probability density of (X, $) equals the Radon transform 5R[Wp] of the 
Wigner function with respect to the measure A/tt, where A is the Lebesgue 
measure on R x [0,7r]. 

/oo 
Wp{x COS (j) + ts\n4>,xsi\i(t> — t cos (3) 
-oo 

and X has density Pp{xl4)). As we annouced in the introduction we do not 
observe the ideal data (X^, $£) t — l,...,n but a degraded noisy version 

(yi,$i)...(r„,<i>„), 

Y, + ^{l-7^)/2^, (4) 

with a standard Gaussian random variables independent of all [Xk, $fe) and 
< 7/ < 1 is a known parameter. The parameter rj is called the detection 
efficiency and represents the proportion of photons which are not detected due 
to various losses in the measurement process. We note p^{x,4>) the density of 
{Yi, ^i). Thus, p^{x, (/)) is the convolution of the density -^p^{-^, (j>) with the 

density of a centered Gaussian density having variance {1 — ri)/2. We assume 
that the unknown Wigner function Wp belongs to a class A{a, r, L) of infinitely 
differentiable functions. For 0<r<2,a>0 and L > Q define 

A{a,r,L)^{Wp: [ \Wp{u,v)\^e^"'^^^''''''>^^'^dudv ^ {27rfL} (5) 
where u)||2 = ^/u'^ + is the euclidian norm. 
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3.2 Properties of Wigner functions and remarkable equa- 
tions 

In this paragraph we will state some very useful properties the Wigner function. 

Fourier transforms A remarkable relation links the Fourier transform of the 
Wigner function to the Fourier transform of its Radon transform. If we denote 

then 

W?p(icos(/.,isin0) := Ti[pp{-/cj))]{t) = Ep[e''''] (6) 
where ^1 denote the fourier transform w.r.t two, respectively one variables. 

Some remarkable equations In Section[8l most of the proofs make extensive 
use of the following equations. Since 

Ep[e'*^] = Ep[e'*^^] ■ Ep[e'*^^^] 

then 

j^iip^i-zmt) = T,[^p'/,{-/mt)-N^{t) (7) 

= T,[pp{-/mVvt)-N^{t), (8) 



where N^{t) denotes the Fourier transform of ^(1 — rj)/2£_ ~ A/'(0; (1 — rj)/2). 
Then 

N'^it) := Ep[e'*V(^-n)/^i] ^ e'^'^^-'^'K (9) 

4 Estimation procedure 

We are now able to define the estimation procedure of the quadratic functional 
(P — JWp of the unknown function Wp directly from data {Yi,(j)i). Next we 
evaluate an upper bound of the maximal risk uniformly over all Wigner functions 
in the class A{a, r, L). 

4.1 Kernel estimator 

Let us define our estimator as a U-statistic of order 2: 

Definition 1. Let (Yi, (/)(),£ = l,...,n, be i.i.d data coming from the model 
(j4]), and S = Sn ^ as n ^ co. The estimator d^ can be written 

dl:= ,', / 1, E / /%|i|e^(i-'')e'*^-^*^^dM. (10) 

(27r)2n(n-l)^^^y|,|<^yo 

Definition 2. Let d^ be the estimator defined in l|10p . having bandwidth S > 0. 
We call the bias and the variance of the estimator, respectively: 

B{dl) \Ep[dl] - d^p and Var{dl) Ep [\dl - d^p] . 
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4.2 Bias-variance decomposition 



The following proposition plays an important role in the proof of the upper 
bound of the risk as we split it into the bias term and the variance term. 

Proposition 1. Let (Yi,(j>i),£ = 1, . . . ,n be i.i.d data coming from the model 
(U and be the estimator in (fTO|) (with (5—^0 as n ^ oo) of the quadratic 
functionnal of the Wigner function Wp which is lying in the class A{a,r,L) with 
a > 0, L > and < r < 2 defined in ([5]) then, 

1. for alio <r<2 

\Ep[dl]-d'\' < L\'^^"\ (11) 

2. for alio <r <2 



Var{dl) < ' , " + __ i—g— I^-", 12 

TT^n^ riTT 1 — rj 



3. for all r ^2 and ^ -2a> 

Varidl) < ' '\ e—^ + —- '— — 13 



. for allr^2and^-2a<0 

Var{dl) < ^ V ? '" + " ' 1 • (14) 



8r?V(l-77)2 1 8r;L 



n 4a77 — 1 + 77 
The proof of this proposition is given in section [H 



5 Main results 

In this section, the first theorem considers the case of nonparametric rates of 
convergence of our estimator which is proven optimal or nearly optimal (as we 
loose a logarithmic factor in the lower bound) in the minimax sense. In the 
second theorem, our estimator attains the parametric rate \/n. 

Theorem 1. Let {Yi, 4>i),t = 1, . . . , n be i.i.d data coming from the model (|4]) 
where the underlying parameter is the Wigner function Wp lying in the class 
A{a,r,L), a> and L > 0. Then for d'^ defined in (fTOl) and according to the 
definition given to section [2J\ 

1. for < ?■ < 2, with 5 := 6opt solution of the equation 

1 — 77 1 2a /, , \2 

— T2- + t;— = log'^- (loglogn) , (15) 

"opt "opt 

we reach the optimal rate ipn with C„ — 1, Ci — 1/16 constants defined in 
(HI) and {H) 

ifl^L^e^, (16) 
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1 1 "^^^ 

2. for r — 2, — 2a > and by taking 5 = S* = ( ^ ) , the rate 

of convergence is nearly optimal as 



^l^n^^, (17) 



is the rate of convergence in the upper bound JT]) and 



ifil = {nlogn)W^ , (18) 

is the rate of convergence in the lower bound ^ . 

To prove the Theorem[Tl one has to prove on the one hand, the upper bound 
(section [7]) and on the other hand, the lower bound (section [9l) according to the 
definition given in section [JTJ 

Theorem 2. Let {Yi, — 1, . . . , rt be i.i.d data coming from the model (|4]) 
where the underlying parameter is the Wigner function Wp lying in the class 
A{a, r,L), r ^ 2, a > 0, L > and ^ - 2a < 0. Then for defined in ifTOj) 

/ \-i/2 

with S = 5* = [ H-^ejl J ^ ^/jg rate of convergence is parametric: 1^^ = —. 



Moreover, in this case our estimator ^ is asymptotically normally distributed 

with asymptotic variance 

y^ = 2^J J \ti\\t2\e^'^*^E[e^''^]E[e'''^]E[e-'^''+'^^^]dtidt2-4d'^. 

The proof of the Theorem [2] in given in section [71 

Remark 1. We are able to give a more explicit form for the bandwidth and thus 
for the bias term which is asymptotically equivalent to the rate according to the 
values of r. Let s„ := i°g") wkere a := ^^-p^. then we make successive 

approximations in lfT5|) starting with Sq and we plug it back into (fTSl) . we find 
5i. And successively for all k > 1 we have Sk- Values are given by table[I\ and 
table\M 



Table 1: Procedure 



Sq Si for all k > 1, Sk 



-1/2 
= Sn 



In the previous theorems, the bandwidth Sopt depends on the parameters 
a, and r of the class A{a, r, L) which may be difficult to evalute in practice. 
However, it is possible to construct an adaptive estimator which does not depend 
on these parameters and which attains the same asymptotic behavior as in 
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Table 2: Rates of convergence 



If r It is enough And the 

belongs to to choose rate is 



r G]0,1] 


6 = 


= <5i 




-4as' 






r e]l,4/3] 


6 . 






-4:as] 




-i-o(l)) 


i2(fc-l) 2fe 1 
' ' fe+lJ 


6 = 


--Sk 




-4as^ 




-i-...+c.-isr^^"(''-^)+o(i)) 



Theorem [U provided that these parameters lie in a certain set. Note that the 
parameter rj is supposed to be known. Define two sets of parameters 

Oi = {(a,r,L) : a > 0,L > 0,0 < r < 1} 

©2 = {{a,r,L) : < a < aa,L > 0,r = 1}, ao > 0. 

Theorem 3. Let (Yi, (f)i),£ — 1, . . . , n be i.i.d data coming from the model (21). 
For S = (5*^, i = 1,2, let „ be the estimator defined by 

din-.- ,,\, / n E / /%|i|e^(i-'')e^*^-»*^^d0dt, 
(27r)2n(n-l) /|t|<_^7o 

imih Al — / 2i) log n _ / 2t)logra wl/2 j r2 _ / 27)logn _ 4At) / 2i) log w wl/2 

v4 > ao. Then, for all {a,r,L) e 9i, i = 1,2, respectively, 
limsup sup - d^n,^-^ < 

where Lp~'^ is the rate defined in ifTG]) anrf i/ie constants are respectively Ci = 1 
anrf C2 = exp - ^). 

The proof of the adaptive case in given section [71 



6 Examples 

The Table [3l shows five examples of quantum pure states and one example of 
mixed state which can be created at this moment in laboratory. Among the pure 
states we consider the vacuum state which is the pure state of zero photons, the 
single photon state, the coherent state which characterizes the laser pulse with 
an average of TV photons. The squeezed states have Gaussian Wigner functions 
whose variances in the two directions have a fixed product. And the well-known 
Schrodinger's Cat which is also a pure state. 

Note that for pure states, = l/27r. The thermal state is a mixed state 
describing equilibrium at temperature equal to having Gaussian Wigner 
function with variance increasing with the temperature. This state is mixed 
and here we find d^ = tanh(^/2) ^ these examples of quantum states, the 
procedure gives fast parametric rates with r — 2 and — 2a < 0. We can 
easily check that each Wigner function belongs to the class A{a, 2, L) for small 
enough values of a (see Table [3l) . 
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Table 3: Examples of quantum states 



State 



Fourier transform of Wigner 
function Wp{u, v) 



in the class 
Aia,2,L) if 



Vacuum state 
Single photon state 

Schrodinger's Cat Xq > 
Coherent state N eR+ 
Squeezed state N £ M+, ^ e 
Thermal state (3 > 



exp 

1 - 



|^ -Il(y)ll2 ^ 



\(u.v) 



exp 



exp ' ' 



(^cos(2uXo) + e^^o cosh(Xow)^ 



4 ^ 



exp 1 4 - 4 

-Il(",'")ll2 



iVNv 

2? 



exp I 



4(tanli(/3/2))^ 



) 



a < 1/4 
a < 1/4 

a < 1/4 
a < 1/4 
a < e2V4 



a < 



(tanli(/3/2))^ 



Our previous results show that our estimator of the purity atteins the parametric 
rate 1/n if 77 > jq^- This is not restrictive at all. In practice, physicists usually 
find 77 > 0.8 and more often ij is close to 0.9 and 0.95. Thus, by choosing a as 
close to its upper bound (in Tabled as possible we make sure that our estimator 
attains the parametric rate. 



7 Proof of the upper bounds of theorems 

Sketch of proof of upper bound in Theorem [Tl- I|16p For < r < 2 and 
by (HH) and ^ 

7r^77"' 777r 1 — 77 

Cvi 1 Cv2 1-'' 1 2q 

= — — e ^ ~\ e sr- ^ 

n 

On the one hand, we select the bandwidth 5* as 

5* ^ arginf l^e^I^ + ^Xl^^^j.-^ + Cse''-"''] , 

(5>0 [ 77^ 71 J 



by taking derivatives, 5* is a positive real number satisfying 

+ l0g((5*'-2) =log77 



277 (5*2 J* 

and we notice that B{d^) ~ 5^~'^Var{d'^). So the rate of convergence for the 
upper bound is given by the bias i.e. = B{d'^^)[\ + o{l)). On the other hand, 
we show that by taking 5 :— Sopt the unique solution of the equation 



1 — 77 1 2a , , ,0 

o„ .2 + = logn - (log log 77) 

"opt "opt 
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we obtain the same results. We find B{d^) ^ 6^ ^Var{d^) for S* . 

^Ipt' / 1-77 -2a \ 5l~^ f 4a 



(loglogn)2 y 5, 
(logn/(2/3))('"'')/' /-4a 



opt 



(loglogn)2 ^^"^ySlpt 
4a \ 
<5!~ 



0(1) exp ( 



^ opt 



Last equalities are due to Lemma 8 from Butucea and Tsybakov [6j. We note 
that, the variance term with 5 opt is bigger than the variance term with 5* but 
these terms are asymptotically negligible w.r.t. the bias ones. This improvement 
does not appear in the main term of the asymptotics. Then we conclude (ys^ = 

I/^ exp (yW^ + 0(1)). The lower bound is proven in last section. 



Sketch of proof of upper bound in Theorem [TI- (fT7|) For r = 2 and 

^ - 2a > 0, we have by ^ and JTS]): 

TT^n^ TiTT 1 — 77 — 4a77 

To select the bandwidth, we choose 5 — 5* as solution of 

5* ^ arginf |%e^^ + ^e^^-^")^ + C^e-^"/^^ 1 
<5>o y n-^ n J 

By taking derivatives, we found 5*, a positive real number satisfying = 



i^v^n^ we get the rate (/j^ = n 2-7" . The proof of the lower bound is in last 



27, 



section. 

Proof of the parametric rate in Theorem [2] For r — 2 and — 2a < 
we have by (HI]) and ifTij) : 

E[\dl - d 



TT^n^ 4a77 — 1 + r; n 



And we can write by taking — ^'^^^ 

1-77 1 

sup E[\dl-d^\''] < + <Cy-+CBn(T^ 

WpeA{a.2,L) n n 

< Cv-{l + o{l)). 
n 

So we find a parametric rate. The proof of the asymptotic normality is in the 
section 18.31 
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Proof of upper bound in Theorem \3\ Our proof is based on results of 

4i) 



Butucea and Tsybakov [6]. Define a :— K-^- 



Over the set Qi 

As < r/2 < 1/2 it is easy to remark -(^- y^)"/' > -j^\f^foT n 

large enough, and thus exp (^(Jt^) — ^xp ^— 2ay^^^^^ ■ On the other hand 
the first and second variance terms found in (fT2l) are equal respectively 

1 / 2a 2a \ I \ogn I logn /logn 

■ exp ,,1 - TTT^ exp -2aW — 2a' 



{51,? {Sl,yj " V 2a I 2a V 2a 



Therefore, with the bandwidth S^j^ the ratio of the bias term found in (fTTj) to 
the first and second variance terms are bounded from below respectively by 

exp ^a(^^- ^ ^ and exp (2a^J^^ . These expressions tend to 

oo as n — > CO. Thus, the variance terms are asymptotically negligible w.r.t 
the bias term. It remains to check that the bias term with the bandwidth S^^ is 
asymptotically bounded by (p"^. For n large enough 

r/2\ 



L^exp -M^)- l-(^) 



r/2^ 



< exp 



Over the set O2 

As r = 1 a simple calculation shows that 6opt = (^^^r ~ ^\J^^2a^ ^ 

correct approximation in this case, giving a variance infinitely smaller than the 
bias which is of order 

1/2N 



2 r2 4a \ 2 { A i ^OS'^ " 



exp I -Aa^l ^ + _ (1 + o(l)). 
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As for the estimator with bandwidth (5^^ we get 

= C,L'»p('-4»/!|^+^') (1 + 0(1)). 

Hence the results. 



8 Proof of the Proposition 1 

Main tools Note that extensive use is made of formulaes ([6]), Q, ^ and 
([U and that Plancherel formula writes, 



/ Wl{p,q)dpdq = ^ / \Wp{u,v)\^dudv. (19) 



8.1 Proof of Proposition 1-(1TT]) 

We write E[-] instead of Ep[-]. Because Yfe and Yg are i.i.d for &\\ k ^ (. 



(27r)2„(n-l)^,^^7|,|<^7o 



d(t)dt 



1 



(27r)2 Jit\<^Jo 



77|t|e^(i-'')|.FK(./<^)](t)pd0dt. 



Use (O, and a changes of variables T = and next the polar coordinates 

u — T cos (j), V — T sin 

|2, 



(27r)2 y|T|< 1 Jo 

1 



\Wp{u,v)\ dudv. 

in,v)\\l<i 



i2n) 

So by combining lfT9|) et l(20|) and define w := (w, w) 

= IjT^ [ r v\t\e'^^'--^^\E[e^'^]\'d<pdt\ 



1 



(2^)2 



as Wp belongs to A{a, r, L). 
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8.2 Proof of Proposition l-(ra-(fT3l)-([T 



First we center variables 

dl - E[dl] ^ 



E 



An^nin — 1) ^ /ifi<j_ in 



E 



An'^nln ~ 1) ^ in 



^\t\ 



-itYi 



Let define by Z^it) — ■= e*'^* — ^^[e'*^], and its complex conjugate, then: 
1 



di~E\di] = 



(27r)2 \n(n-l)^^y|,|<_^yo 



-E 



l*l<^-^o 



V\t\e 



V(i-'?)/ 



Zj£;[e-"^] + ZjE[e'*^])d<j)dt 



Denote by Ji and J2 respectively the first and the second term of the previous 
sum, we have then 



Varidl) = E[idl - E[dl])^] = i?[J?] + £;[j|] + 2E[J,J,]. 
See that the third part of the previous sum: 

E[JlJ2] 



(20) 



(2 



Tyltle^^i-") {E[e-'*^]Zj + E[e'*^]Zj) d(f>dt 



= 0. 



By noticing E[Zj] — for all j — l,...,n, and because there always exists a 
j k and thus Zj, Zk are independent or a j 7^ £ and Zj, Zi are independent. 
Now study 



1 



167r4n2(n- 1)2 
1 

167r4n2(n- 1)2 



E E^ 
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Note that, as soon as an indices ki ,£i, k2 ,£2 is different from the others, the 
expected value is 0. Thus, 



167r%2(„_ 1)2 



\t\< 



[ ri\t\e'^^^-'^^ZkZid(l)dt] 
) 



l*l<i^-^o 



r7|i|e'r(i-'')ZfeZ^#dt|2 



E[Jl] 



■Ie 



167r4n(n - 1) 2 



1 

"l67r%(n- 1) 



l*l<j^-^o 



??|i|eV(i-'')ZiZ2#dt 



E 



Tj\t\e^^^-''^ZiZ2d(l)dt\^ 



1 



167r4n(n - 1) 
1 



3ie 



r]\t\e'^'^^-'^'>ZiZ2d<pdt 



+ 



167r4n(n - 1) 



E 



[ l\\t\e'^^^-'^^Z^Z2dct>dtf 
J\t\<-^ Jo 



By noticing that |3ie(2:)| < \z\ and using the fact \Zk\ < 2, we get 
1 



E[Ji] < 



< 



< 



SiT^nin - 1) 
27r2 ' 



E 



[ r r)\t\e'^^^-'>^ZiZ2d(l)dtf 



V\t\e 



1(1- 



77^(1 — r])^ h 



1 

2 *i2 



exp 



1 — 77 



(21) 



15 



Noticing that {Zk)k are i.i.d and centered, we then have by developing the 
square: 



E[J'i] = E 



1 



167r4n 
1 



E 



-E 



< 



1 



77 1 i I e V ( 1 23? e (£; [e"'*'] Zfe ) d</.(it 
T^\t\e'i^^~'^h^e{E[e^'^]Zi)d(j)dt 
7? I i I e 3? e (£; [e'*^] Zi ) d<^di ^ 



(22) 



as |3?e(2)| < \z\ and |Zi| < 2. Then use successively Qand ([9]), next (HI, and a 
change of variables T = ty/rj 



1 1 



1 1 

1 1 



l*l<i^^o 



|r|e?|(i-'')|Fibp(./0)](r)|d0dT^ 



Then, by ^ and next use the polar coordinates u = Tcos^, v = Tsinc 
1 1 ' " 



1 1 



I r I e —( 1 I (T cos T sin 0) I d0dTj 



eVll("'")ll2|W'p(u,?;)|dud?; 



\J||(M,tl)||2<l/5 

Define z :— and use Cauchy-Schwartz inequality and ^ 



1 1 



eVll^ll2|Vt/p(;2)|d2 



||2||2<l/5 
8L f^/^ 1-T| j2 n„.jr 

mr 



(23) 
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1. For < r < 2 and according to Lemma 6 of Butucea and Tsybakov [6j we 
get: 

— / te~* dt < !_e— I^-2ap.^ 24 

mr Jq mr 1 — rj 

The expressions |(2l]) and ([231 together with |[24l conclude (fT2| . 

2. For r = 2 and — 2a > and according to Lemma 6 of pj we get: 



8L f^'^ i^e-2ae , 8L 



2(i^-2a) 



(^-2q)45- 



< 



8L 



mr 1 — rj — Aarj 
The expressions 1(21]) and ([23]) together with |(25]) conclude ([13 



(25) 



3. For r = 2 and ^ - 2a < we have: 



4i 1 



< 



8r]L 



2a — n Aa-q - 1 + ?/ ?i 
The expressions |(2l]) and ([231 together with |(26| conclude (fT4| . 

8.3 the asymptotic normality 



(26) 



Let r = 2, - 2a < and 5 = <5* = (^j^) 



-1/2 



V^(d2 - d?) = ^{dl - E[dl]) + ^^^B(d^. 



The term -y/ nB{d'^) < y/nLe T^W tends to as n — > 00. 

Moreover \/n{dl^ — -E[<i^J) — \/ri{Ji + J2), where Ji,2 are centered and were 
defined in Section 18.21 It has been shown in (f26|) that the dominating term in 
the variance i?[(d^ — £'[d^])^] is given by defined in l(22|) . That means 

nE[Ji] = 0(1), as n — > cx) and \Ai>^i ~^ 0- Thus, the asymptotic normality is 
given by the term y/nJ2- 

As J2 = ^E,/|t|<^r»/l^|e'^''""n^J^[e"'*'']+^J-^[e'*''])«''^'^^)' can 
use a classical central limit theorem for i.i.d. random variables with finite vari- 
ance and the asymptotic variance is given by the limit of nE[J2]. Let us study 



lim„^oo nE[Jl 



iE[j\ 



47r4 



E 



5Re 



77|i|e-(i-'')(i;[e'*^]e 



tY^-itY 



E[e''^]E[e-''^])d(t)dt) 



47r4 



E 



3?ef/ r Tj\t\e^'^^-'^^E[e''^]e-''^d(j)dt\ 



1 

'4^ 
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On the one hand, we already proved in section ISTH A2 — 4, (E[dl]Y . Therefore, 



lim„^oo Aq 



Wp II 2- On the other hand 



47r4 
1 

4^ 



E 



3?e 



77|t|eV(i-'')£;[e 



itYT-itY 



dipdt 



lie [\'Mh\e^^'-^^E[e^'^^]E[e^'^^] 

■E[e'''^^^+*^^^]d(j)idtid(j)2dt2. 
By changing the variable t into t/ ^ and as Y/ ^ = {X + J S^t 



Ai = ^ 



£;[e-'(*i+*2)i'/v^]dfidt2 



\ti\\t2\e^^*^+''^^E[e'^^^/^]E[e''^^'^] 



1 

4^ 
■E[e 



|tl|<H|t2|<i 

-'i(ti+t2){X + J- 



|ii||i2|e'^(*?+*^)i?[e^*^^^+V^«)]£;[e'*^(^+V^«)] 



]dtidt2 



As X and <^ are independent and since E[e 



" , we get 



1 

4^ 



|ii||i2|e 2., 

|ti|<i-'|t2|<i 

1 

4^2 



^(*?+*2)e-^(t?+t2)e-^(*i+*2)'^[e^ti^] 



/ / \h\\t2\e^'''^E[e''^^]E[e''^^]E[e-''^''+'^-'^^]dhdt2, 



and lim„Ai = 3i^//|ii||t2|e^*^*^£;[e'*i^]i?[e^*^^]i?[e-^(*i+*^)^](itidi2. 
By denoting W = lim„^oo(j4i — A2), we get the result. 



9 Proofs of lower bounds 

In this section, we will show the lower bounds of Theorem[TJ For that we will be 
based on the results of Butucea and Tsybakov [6] . They show that the problem 
of bound from above the minimax risk can be reduce to two functions Wp^ and 
depending on a parameter 5n — 5 such that 5 ^ as n — > 0. The choice 
of 5 insures the existence of the lower bound. The parameter 5 is the unique 
solution of the equation 

^ + ^ = log" + (loglogn)2. (27) 

If < r < 2, notice that it is different of the 5 appearing in the expression of 
our estimator defined in (flOl). And for r = 2, we take 



logfn log n) \ , 1 — ?7 , ^ 
, where a = — — ^. 28 

2(a + a) y ' 4?7 ^ ' 
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We will use Wigner functions Wp„ and Wp-^ built by Butucea et al. (2007) [5] 
in their first prepublication like certain results coming from this construction. 
Wp„ is a fixed function corresponding to the density matrix po, and Wp^ is of 
the form 

WpAz) = Wp,{z) + V',iz) and 

Pl= P0 + 

such that pi is a density matrix (positive and trace equal to one) with Radon 
transforms pi . Note that the function Vj is not a Wigner function of a density 
matrix but belongs to the linear span of the space of Wigner functions and its 
corresponding matrix is in the Hnear span of density matrix. We will detail 
in a next paragraph the construction of Wp^ ^ , po,i and Vg as well as the results 
which results from this. As we have stipulated it higher, we will use lemma 
4 in Butucea and Tsybakov [6]. Let us suppose first of all that the following 
conditions are satisfied: 

Wp„Wp,eA{a,r,L), (29) 
\dl - 4\ = IIIW^,, \\l - \\Wp„\\l\ > 2M^ + 0(1)), n ^ oo, (30) 

nx' r f ^^M_P^Mldydcj, = o{l), n-.oo. (31) 







Poiy) 



Then we reduce the minimax risk to these two functions, Wp^ , Wp„ , and note 
an arbitrary estimator of dp := ||VFp||2, then we get for some < r < 1 

inf sup E[\d^,-dl\'] > MhEp,[\df,-dlf]+Ep,[\d^,^dlf]) 

dl WpeAia.r,L) dl ^ 

d^ 

+ (1 - r)i?p„[I(-^ > 1 - r)|d^„ - <p]) 

1 dP^ ^ 

> inf -(1 - r)(i?pjl(-^ > 1 - r)(|?„ - d^f 

d^ / "-f Po 

As + 6^ > (a — 6)^ for a and b positives reals numbers, we can get ride of the 
estimator. 

1 dP""' 

> -(1 - r)EpM^^ > (1 - r))K - dlf] 

dP^ 

> (l-r)</>2(l-Fp„(_^-l<-r)) 

dPl 



1 f dP^ 

> ii-rmi-- y(_|-i)2dp.). 



Pa 

By supposing nx^ < t'^ the last inequality is undervalued by (1 — t)^</)^(1 + r). 
It is enough to check l(3T|) . in order to get r„ ^ as n ^ oo, and we obtain a 
lower bound for the minimax risk of order 0^(l + o(l))for any estimator S^. Our 
proof of lower bounds is quite similar to the one of Butucea et al. (2007) [5]. The 
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main difference is the proof of (|30| as we don't bound from below the Wigner 
function but the quadratic functional of the Wigner function. Nevertheless, for 
the reader's convenience, we reproduce key proofs to complete the proof of the 
lower bounds. 



9.0.1 The density matrix po 

The main difference with the construction in Butucea et al. (2007) |5| is that 
they had considered two Wigner functions Wp^ and Wp^ with Wp^^p^ = Wpg±Vg 
while we consider only the Wigner function Wp^ = Wpg+Vg. Because we have to 
bound from below the quantity HIWpJIj— HW^pollil instead of \\Wp^ —^poWl ^^d 
we must make sure that Wp„ and Vg are positive functions. In this paragraph 
we will recall some results and lemmas of Butucea et al. (2007) [5J about the 
density matrix po and its corresponding Wigner function. They had constructed 
a family of density matrices p^'^ from which they selected po = p^°'^° with 
Radon transform p| equals to 

where/|(z) = I3{1 - z f / {1 - £_)l{£_ < z < 1), for some < < 1- The Fourier 
transform is 

wliw) = .Fib|](|h||,</.) = I'y^^^p (""^"'i^i^) 

Notice that the Fourier transform is positive and VF|(0) = 1. The study of the 
asymptotic behavior of such functions is done in lemmae [T] and [2l Lemma [3] 
proves the fact that W| belongs to the class A{a, r, L) for > small enough 
and ^ close to 1. 

Lemma 1. For aZZ < < 1 and \x\ > 1 there exist constants c, C depending 
on (3 and ^, such that 

c|a;r(i+2'3)<4(x,0)<C|a;r(i+2^). 

Lemma 2. For allO < <l we have 

P^-n = + l)n-(i+«(l + 0(1)), n ^ oo. 



Lemma 3. For any {a,r,L) such that < r < 2, there exists anO < < 1 
such that W| belongs to the class A{a,r,L). 

We refer for the proof of these lemmae to Butucea et al. (2007) [5]. 



9.0.2 Construction of and asymptotic properties of r 

For using the same construction as Butucea et al. (2007) 0, we have to define 
on the function Vg whose Fourier transform is 



T^mw) Vg{w) = Jg{t) = 2V^^5~(2-'-)/^e"/^'^e-2"l*IV(|ir - ^), 

0^ 
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where t ~ \\w\\, and J is a 3-times continuously differentiable function on M 
with its first 3 derivatives uniformly bounded on M such that for any A > and 
any D > AX 

I(2A < u < D - 2A) < J{u) <l{X<u< D - X), for allu G M. 

We choose S solution of ((271) when < r < 2 and 6 such as in ((28|) . when r = 2. 
We want Vj to be a function of a density matrix belonging to the Hnear span 
of the space of Wigner functions and its corresponding matrix belonging to 
the linear span of density matrix. For that, we use an important property of 
Wigner functions: the isometry (up to a constant) between the linear span of 
density matrices and that of Wigner functions with respect to the L2-distances, 
in particular 

\\Wp,-Wp,\\ =■■ I J\Wp,{p,q)-Wp,{p,q)\'dpdq=l-\\p2-pi\\l, 

for any p2, pi- Note that because the function Vj is invariant under rotations in 
the plane, the corresponding matrix has all off-diagonal elements equal to and 
for the diagonal ones we can use the following formula from Leonhardt (1997) 

m 

t^^At:^ f L^{f/2)e-''/Hj^,{t)dt. 
Jo 

And as our choice of Vg is the same as Butucea et al. (2007) [5], we have the 
same asymptotic behavior derived in the following lemma. 

Lemma 4. The matrix has the following asymptotic behavior 

ril = 0(n-5/4)o^(l). 

For the proof of this lemma we refer to Butucea et al. (2007) |5]. We have 
now to prove conditions l|29p . (f30| and (f3T|) to obtain the lower bound. 

9.1 Proof of conditions (EID, (EOD and dM]) 

Proof of ((29l) From Lemma reflm:3 we get for any (3 small enough and ^ 
close to 1 that the Wigner function W| belongs to the class A{a,r,a?'L). And 
the Lemmae reflm:2 and reflm:3 implies that for any /? < 1/4 the diagonal 
matrix pi = p^'^ + is positive with trace one for 5 small enough. Thus there 
exists an /Jq, such that the corresponding matrix pi is a density matrix and 
= W|° € A{a,r,o?'L). Let us prove that Wp^ € A{a,r,L). By triangle 
inequality 

||^2[W^pJe"ll-lll2 < ||^2[W^p„]e"ll-ll"||2 + ||.F2[l/j]e"ll-ll"||2 
< 27raVI+||.F2[y^]e"ll-ll'^||2. 
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Now, by the change of variables u — tcoscf), v — i sin^ 

^ lo I l*ll-^2[V'5](<cos(/.,isin0)pe2"l*l''dtd(/. 

= n [ \t\\ J'2[Vs]{t cos (t),t sin (j))\^e'^°'^'\^dt 
Jr 

= n [ \t\\J~,{t)\'e'"^'^^dt 



Thus, it is enough to take a = 1 - e-"^/^ to get Wp, G A{a, r, L(l - 6""^/^ 
e-"^)^) cA{a,r,L). 

Proof of i(30l) By noticing that Wp^ and Vg are positive functions we get 

ll|W^pJl2-||W^Poll2l > T^All / \nM\'dw\^-^\n [ \t\\J~,{t)fdt\ 

, ^ (D-2A+i)l/'- 

> .^2\^Lard^'-e^''/'''2 / te^^af^^ 

. „(D-2A+-L)i/'- 



> 
> 



iie^"/^"^ (e-4-(2A+i) (1 ^ ^ ^_4a(D-2A+i) ^(^^^^ 

20„ (e-8"A _ g-4a(Z3-2A)^ (l + o(l)) 



for n large enough, with <^„ = jipn where ipn is the rate of convergence define 
in (fT6l) . Note that we obtain lower bounds for 5 solution of l(27|) for the case 
< r < 2, while we obtain optimal rates (up to a logarithmic factor) of order 
{n\ogn)~'^ for r = 2, with a defined in l(28|) . 



Proof of l(3T|) Let us now bound nx^- Ftom the lemma 6.1 we get that 
Po(a^) > Cx~'^ for all |x| > 1. After a convolution with the gaussian density of 
the noise the asymptotic decay can not be faster 

p2(2/)>^,V|y|>M, 

yZ 

for some fixed M > 0. Notice that C design a constant which may change along 
the proof. 

< Cn ic{M)\\p\{y)-pl{y)\\l + f {pRy) ^ dy] (32) 

\ J\y\>M ) 
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In the first term we have 



< CT^^-e^"/^" / ' g^(l-r,)tV(2r,)-4at-^^ 

/•OO 

-'(A+Jr)^'"- 

< CT^-expf-^-i^y (33) 



Let us see the second part of the sum 



\y\>M 



dt 



In case < r < 2, by taking 5 as solution of l(27|) we have the expressions in 
(HH and ([Ml) tend to and together with ([32]) conclude. For the case r = 2, 
we proved a weaker form of ((3T|) : nx^ = 0{1). As 5 given in l(28l) . we have 
the expression in ([33l) tend to while the expression in ((3^ stays bounded as 
71 ^ oo and together with ([32l) we get the wanted result. 
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